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(57) A serv^eriess name resolution protocol ensures 
convergence despite the size of the network, without re- 
quiring an evernncreasing cache and with a reasonable 
numt)ers of Jiops. This convergence is ensure mnmgh 
a muitf-ievei cache and a proactive cache initialization 
strategy. The multi-levef cache is built based on a ctrcU"- 
lar number space. Each level contains information from 
different levels of silvers of the circular space, A mech- 



anism is Inciuded to add a level to the multf-levei cache 
when the node determrnes that the last level is fulL TIto 
PNRP includes a mechanism to allow resolution of 
nannes which are map^d onto the circular number 
space through a hash function. Further, the PNRP may 
also operate with the domain name system by providing 
each node with an identification ccmsisting of a DNS 
component and a unique number 
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DescHption 

CROSS-REFERENCE TO RELATED PATENT 
APPUCATiONS 



[0001] This patent appltcatfon claims the benefit of U, 
S. provisional patent application 60/280,896 fiJed April, 
2, 2001 , the teachings and disclosure of which are here- 
by incorporated by reference. 

FIELD OF THE INVENTION 

[0002] The present Invention relates generaSty to 
name resolution protocols, and more particularfy relates 
to peer-to-peer name resofutfon protocols. 

BACKGROUND OF THE INVENTION 

[0003] Peer to peer communlcaUon, and In fact all 
types of communication, depend on the possibHity to es- 
tablish connections between selected entitles. Entitles 
may have on© or several addresses. Indeed, these ad- 
dresses often vary as the entities move fn the network, 
because the topology changes, or because an address 
tease cannot be renewed- A classic architectural solu- 
tion to this addressing problem is thus to assign to each 
entity a stable name, and to "resolve" this name when 
a connection js needed. Thrs name to address transla- 
tion must be very robust, and It must also allow for easy 
and fast updates. 

[0004] There are two classic tvnes 

name services 

to wit, those based on the multicast, and those based 
on centralized servers. Recently, the pure peer-to-peer 
networks Gnutella and Freenet have tried to perform the 
naming function using distributed algorithms. Unfortu- 
nately, all of these algorithms have limitations, which 
limit their ability to provide a universal solution fn net- 
works approaching the size of tiie Internet. 
[00051 In the multicast architecture, the requests are 
sent to a multicast address to which all the stations in 
the group listen. The target recognizes its name, and 
responds. Examples of such services are SLP and SS- 
DR Unfortunately, multicast services involve a high net- 
working overhead, since the network must transmit 
many copies of any request. Additionally, they also in- 
volve a high conr^utlng overhead, since all the members 
of the group will receive and process all queries, only to 
discard those in which they don't recognize their own 
name. Because of these overheads, the multicast archi- 
tecture is typically only used In very small networks that 
coritain a limited number of nodes and a small number 
of links. In order to scale, the multicast protocols often 
include a provl^on for the insertion of centralized serv- 
ers, and a transition to a centrafized mode when a server 
Is present. 

[0006] In such a centralized architecture, the requests 
are processed by a centralized server whose database 
contains the mapping between names and addresses. 
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The domain name service (DNS) used today In the In- 
ternet combines a centralized root with a network of 
servers, organized to resolve hierarchical names. Un- 
fortunateiy, centralized and semicentralized services 
5 h a ve proven to have several kinds of weaknesses. First, 
because all trust relies on the central serv^, i^dating 
information requires strong controls, in practice, central- 
ized servers have difficulties coping with the load, and 
can only work If a large fraction of the queries are solved 
10 by means of caches. Old copies of the name to address 
resolutions linger in these caches, however. vi^Ich 
makes fast updates difficult. Further, the centralized 
server Is a point of political, legal and commercial con- 
trol. These controls can interfere with the rellabiiify of 
the service. One may be tempted to dismiss these weak- 
nesses as mere scaling Issues, but it Is very clear that 
they derive directly from the use of centralized slices. 
[0007] In GnuteSa. there Is no central database. Eadh 
node "knovi^s" about a set of named objects. A global 
20 search Is performed by executing parallel searches on 
the neighboring nodes within a spedfted "radius" and 
merging the results. This form of spreading trades mem- 
ory, the footprint of the database on each node, for mes- 
sages and computation. If the database is partHbned in 
25 p components, for example, then each request will re- 
quest at least P messages and fill trigger searches In at 
least P nodes. If the dataset is limited in size, then the 
number of components P is entirely a function of the re- 
lation between the size of the dataset and the maximum 
^0 size S that a given node can store. In that case, the sys- 
tem scales if the riumber P of components is basically 
a constant. However, as the number N of nodes increas- 
es, the number of copies of a given component grows 
as 0(N/P). which is equivalent to 0(N), As such, the 
35 number of searches grows as me number of nodes, 0 
(N). Therefore, the number of searches that a given 
copy of a component must process scales as the 
number of searches divided by the number of copies. 
As both numbers grow linearly with N, the number of 
^0 searches per copy remains constants 

[0008] Unfortunately, In a name s^er appilcatk>n 
both the size of the database and tie number of search- 
es grow linearly with N, the number of members. This 
presents a scaling problem. Specifically, there will be 0 
■'s (N/P) copies of any components, and 0(N} searches per 
unit of time. As such, each node will have to send 0{P) 
message per search. Since each component will be 
searched 0(N) time, each copy will be searched (0(N}/ 
0(N/P)) = 0(P) times. If there is a maximum size S for a 
so given component, limited by the available memory, then 
P must grow as 0{N/S). If we assume that S fs constant, 
then P must grow as 0(N). Thus, the number of searches 
that each node processes and the number of messages 
that each node sends and receives will both grow as 0 
55 (N). In short, if the dataset grows as the number of 
nodes, then a simple partitioning strategy cannot be 
sustained. In fact, a surge in Gnutella demand during 
the N APSTE R trial caused the system to collapse. Latw; 
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the surge In demand caused the average traffic to ex- 
ceed the capacity of modem finks, which in turn caused 
tile Gnuteiia system to splinter In a set of disconnected 
networks. 

[0009] Freenel Is a "peer to peer" network that organ- 
izes ilself with an orgaric algorithm. The purpose of the 
network Is to distribute documents, identified by a binary 
identifier A search for a document will result in a re- 
quest, propagated to a neighbor of the requesting node 
as illustrated In FIG. 8. !f this neighbor does not have a 
copy of the document, it forwards the request to another 
neighbor, and soon. Ef the document is found, each node 
tn the path, Intum, gets a copy, until finally acopy arrives 
at the initial requester. Also, there are cases in which no 
copy wiif be found, and the search will fail. Nodes that 
fonward searches do not select a neighbor entirely at 
rarKlom. They compare the document's identifier to oth- 
er Iden^ers that where previously served by the neigh* 
bors and stored in their routing tabie. Information stored 
includes a unique number, the address, and a certificate 
for these neighbors. The node then selects the "closest" 
neighbor whk^ prevkjusly served documents wfx>se 
identifi®^ were most similar to the searched Identifier 
According to the authors of this algorithm, nodes that 
receive successive requests for similar documents will 
accumulate a "cluster" of such documents. As such, the 
most popular documents wifJ tend to be copfod near the 
pJace where they are needed. 

{OOlOJ Freenet nodes maintain a "routing table" that 
associates document identifiers and the identification of 
neighbors from which a document was received. The 
routing tables are updated as a by-product of the retriev- 
ai process, r,e, when a request is successful, each node 
in the path enters in the table an entry linking the docu- 
ment Identifier and the neighbor node from which the 
documeni was rece?ved, \n a real life environment, there 
are limits to the practical size of the routing table. Once 
the iimit is reached, nodes will have to select the entries 
that they intend to keep, or drop. When the limit is 
reached, a new input will replace the least rec^tiy used 
entry. 

[OOtlJ When a document fs sought, the node looks 
up the nearest key in its routing table to the key request- 
ed and forwards the request to the corresponding node. 
Jn Freenet, the key is a 160-bit rMjntier. The routing table 
to find the best ^Ited neighbor If this neighbor is al- 
ready listed in the path, the next one is selected, etc. If 
the search In the routing table is Inconclusive, and if 
there are neighbors that were not already visited, one 
of these neighbors will be selected. If there is no avail- 
able neighbor, the request is sent back to the previous 
node In the path, which can then try a better fit. If the 
request has rolled back ^1 the way to ^e sender and 
there Is no new neighbor, or if the maximum number of 
hops has been exceeded, a failure is declared. 
10012] The use of the Freenet algorithm to provide 
name service in networks containing. In first aii^roxima- 
tion, exacQy one name per ncde in an environment in 



which each node publishes exactly one document illus- 
trates the learning effect and Its limitations. For exam- 
ple, the learning process is quite slow, indeed, the team- 
ing effect varies widely based on several factors. First, 

^ the shape of the graph Influences this process. A graph 
that is more connected yields better results. The number 
of hops allowed for a given request also plays a sub- 
stantial rote in the learning process. If ^at number is too 
small, the results are dramaticalfy worse. The size of the 

10 cache In each node is a factor as is the size of the net- 
work. 

[0013] The success rates achieved through the use 
of the Freenet algorithm vary for various networic sizes, 
after alJowing time for network learning. If the average 

^5 number of neighbors per node is assumed to be 5, the 
requests are allowed to visit up to 256 nodes, and each 
node is able to cache up to 512 entries, the effect of the 
network size becomes quite dramatic. Past a certain 
size, the learning process stops working all together. On 

2£> a 10,000 node network, for example, the success rate 
drops to about 40% . In short, the Freenet algorithm does 
not scale welL 

[00t4J There exists, therefore, a need in the art for a 
naming protocol, to the scale of the Internet, which can 

^5 define the management of at least 10 bJIHon name^to- 
address mappings. A preferred solution should be fully 
decentralized, self-tinlng and efficient ft should also 
provide a high fevel of security. However, as the above 
discussion makes clear, none of the existing technolo- 

^ gfes fHovldes such a protocol. 

BRIEF SUMMARY OF THE INVENTION 



[001 5} The Inventive concepts disclosed in this appli- 
es cation involve a new name resolution protocol that can 
operate In the absence of any centralized server This 
new peer*to-peer. s^^-less name resolution prcrtocol 
ensures convergence despfte the size of the network, 
without requiring an ever-Increasing cache and with a 
^0 reasonable numbers of hops, 

[0016] As discussed above, pure peer4o-peer net- 
works, such as Gnutella and Freenet, use distributed al- 
gorithms to perform the naming function. Unfortunately, 
these algorithms cannot guarantee convergence as ^e 
« size of the network Increases. That Is, they cannot guar- 
antee convergence without lineariy increasing the size 
of the cache with the size of the network, and without 
extending the number of hops that are allowed to an un- 
reasonable nun^r 
^0 [00171 The server-4ess or peer-^to-peer name resolu- 
tion protocol of the instant invention solves these prob- 
ferns and ensures convergence in large networks 
through two mechanisms: a multilevel cache and a 
proactive cache initialization strategy. The mulfelevel 
55 cache allows the protocol to adapt to networks of various 
sizes, and grows only as the logarithm of the size of the 
netwcH-k (not linearly as required by prior peer-to-peer 
protocols). The multilevel cache is built based on an un- 
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derfying concept of a circular number space. Each level 
In the cache contains information from different levels 
of slivers of the circular space. The number of levels fn 
the cache is dependent on the size of the netvi/ork to 
which it attached. However, since this size \s not 
known, a mechanism is included to add a fevei to the 
multflevel cache when the node detemiines that the last 
lever is fuif. In this way, rapid convergence is assured. 
[0018] As a first extension to the peer-ta-peer name 
nesoJution protocol, a mechanism to altow resolution of 
names fs also presented. These names are mapped on- 
to the circular number space through a hash funcfion. 
However, recognizing that there may be multiple entries 
for a single hash value (e,g. in large groups of 10.000 
members), a unique number is associated with the hash 
of the name as <hash>.<unjque number> {<M>.<N>). 
With this extension, the core protocol of the Instant In- 
vention may be used for names as we« as numbens. 
10019] The second extension to the base protocol of 
the present invention provides a real world integration 
of the peer-ta-peer resolution protocol with the domaan 
name system. By providing each node wi^ an identifi- 
cation consisting of a DNS component and a unique 
number, the DNS mechanfsm can be used to Focate a 
server for that DNS component; this can be either a local 
DNS resolver that also has knowledge of PNRR or a 
centralized server that manages the spedHed DNS 
component. This server may then go into the peer-to- 
pe&r name resolution protocol (PNRP) space using the 
protocol of the present invention with the unique number 
portion to find the particular node, and return that infor- 
matlon to the requester. The individual node can find a 
neighbor to help seed its cache by sending a request to 
the centralized server with a random number. 



ed to include name-to-number mappings in accordance 
with the present invention; 

[0027] FJG. 7 is a simplified graphical Illustration of the 
domain name service (DNS) and peer to peer space II- 
5 fustrating cross-over application of the system of the 
present invention between these two spaces; and 
[0028] FJG. 8 is a graphical ilJustration of a peer-to- 
peer space. 

[0029] Whife the invention will be described in con- 
nection with certain prefen-ed embodiments, there is no 
intent to limit it to those embodiments. On the contrary, 
the Intent is to cover all alternatives, modifications and 
equivalents as Included within the spirit and scope of frie 
invention as defmed by the appended claims. 

15 

DETAILED DESCRIPTION OF THE INVENTION 



BRIEF DESCRIPTION OF THE DI^WINGS 

[0020] The accompanying drawings incorporated in 
and foming a part of the speci^tion illustrate several 
aspects of the present invention, and together with the 
description serve to explain the principJes of the inven- 
tion, in the drawings: 

[0021] FiG. 1 is a block diagram generaliy illustrating 
an exemplary computer system on which the present 
invention resides; 

[0022] RG. 2 is a graphical representation of the cir- 
cular number space of the present invention; 
[0023] RG 3 Is a graphical illustration of the average 
number of hops expected for convergence with the sys- 
tem of the present invention; 

[0024] FiG. 4 is simplified llfustratton of the muftifevel 
cache of the present invention; 

10025] FIG, 5 Is a graphical illustration of the number 
of hops versus cache partrtion size for several network 
sizes to reach convergence with the system of the 
present invention; 

[0026] FiG, 6 is a graphical representatfon of the cir- 
cular number space of the present invention as expand- 



[0030] Tu m ing to the drawings, wh^ein like reference 
numerals refer to like elements, the invention is iilustrat- 
2f ed as being Implemented in a suitable computing envi- 
ronment Although not required, the Invention will be de- 
scribed in the general context of computer-executable 
instructions, such as program modules, b^ng executed 
by a personal computer. Generally, program modules 
^5 include routines, programs, objects, components, data 
structures, etc. that perform particular tasks or imple- 
ment particular abstract data types. Moreover, those 
skflied in the art will appreciate that the invention may 
be practiced with other computer system conngurations, 
30 including hand-h^d devices, mt^fi-processor systems,' 
microprocessor based or programmable consumer 
electronics^ network PCs, minicomputers, mainframe 
computers, and the like. The invention may also be prac- 
ticed In distributed computing environments where 
3ff tasks are performed by remote processing devices that 
are finked through a communications network. \n a dis- 
tributed computing environment, program modules may 
be located in both local and remote memory storage de- 
vices. 

*o [0031] Figure 1 illustrates an example of a suitable 
computing system environment 1 00 on which the inven- 
ffon may be implemented. The computing system envi- 
ronment 100 is only one exanrpie of a suitable comput- 
ing wvlronment and Is not Mended to suggest any llm- 
^ itatfon as to the scope of use or functionality of the in- 
vention. Neither should the computing environment 100 
be interpreted as having any dependency or require- 
ment relating to any one or combination of components 
illustrated in the exemplary operating environment 100. 
50 [0032] The Invention is operational with numerous 
other general purpose or special purpose computing 
system environments or conffgurations. Examples of 
well known computing systems, environments, and/a- 
configurations that may be suitable for use wjth the In- 
55 v^tion Inciuder but are not Unrated to. personal comput- 
ers, server computers, hand-held or laptop devices, 
multiprocessor systems, microprocessor-based sys- 
tems, set top boxes, programmable consumer electron- 



4 



EP 1 248 441 A2 



8 



ics, network PCs, minrcomputers, marnframa comput- 
ers, (distributed computing environments that include 
any of the above systems or devices, and the fike. 
[0033] The rnvention may be described in the general 
context of computer-exectiteble instrucUons, such as 
program modules, being execajted by a computer Gen- 
erally, program modufes include routines, programs, ob- 
jects, components, data structures, etc. that perform 
particular tasks or implement particular abstract data 
types. The invention may ateo be practiced in distributed 
computing envlronmente v^^ere tasks are performed by 
remote processing devices that are (inked through a 
communications network. In a distributed computing en- 
vironment, program modules may be located In both lo- 
cal and remote computer storage media including mem- 
ory storage devices. 

[0Q34] With reference to Rgure 1, an exemplary sys- 
tem for implementing the invention includes a general 
purpose computing device in the form of a computer 
110. Componentsofcomputer 110 may include, but are 
not limited to. a processing unit 120, a system memory 
130, and a system bus 121 that couples various system 
components indudlng the system memory to the 
processing unit 120. The system bus 121 may be any 
of several types of bus stnjcfures including a memory 
bus or memory controller, a peripheral bus, and a locat 
bus using any of a variety of bus architectures. By way 
of example, and not iimitation, such architectures in- 
clude Industry Standard Architecture (ISA) bus. Micro 
Channel >W-chitecturB (MCA) bus, Bnhance<i ISA (EISA) 
bus. Video Electronics Standards Associate (VESA) lo- 
cal bus. and Penpheraf Component Interconnect (Pcl) 
bus also known as Mezzanine bus. 
{00351 Computer 110 typically includes 8 variety of 
computer readable media. Computer readable media 
can be any available media that can be accessed by 
computer 110 and incfudes both vofatiJe and nonvolatile 
media, removable and non-removable media. By way 
of example, and not limitation, computer readable media 
may ODmprfse computer storage media and corranuni-- 
cation media. Computer storage media Includes both 
voiatfle and nonvolatile, removable and non-removable 
media implemented in any method or technoiogy for 
storage of infomiation such as computer readable In- 
sfmcfions, data structures, program modules or other 
data. Computer storage media includes, but Is not lim- 
ited to, RAM, ROM, EEPROM, flash memory or other 
memory tecfmology, CD-ROM, digital versatile disks 
(DVD) or other optical disk storage, magnetic cassettes, 
magnetic tape, magnetic disk storage or other magnetic 
storage devices, or any other medium which can be 
used to store the desired Information and whic^ can be 
accessed by computer 110. CommunlcaHon media typ- 
ically embodies computer readable instaictions, data 
staictures, program modules or other data in a modu- 
lated data signal such as a carrier wave or other trans- 
port mechanism and indudes any Informatkin delivery 
mecfja. The term "modulated data signal" means a sfg- 
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nal that has one or more of its characteristics set or 
changed in such a manner as to encode information In 
the signal. By way of example, and not limitation, com- 
munication media Includes wired media such as a wired 
netvrark or direct-wired connection, and wireless media 
such as acoustic, RF, infrared and other wireless media. 
Combinations of the any of the above should also be 
included within the scope of computer readable media. 
[0036] The system memory 130 Includes computer 
storage media In the form of volatile and/or nonvolatile 
memory such as read only memory (ROM) 131 and ran- 
dom access memory (RAM) 132. A basic input/output 
system 133 (BIOS), ccmtaining the basic routines that 
help to transfer Infomiation between elements within 
computer 110, such as during startup, is typically stored 
In ROM 131. RAM 132 typically contains data and/or 
program modules that are immediately accessible to 
and/or presenW^/ being operated on by processing unit 
120. By way of example, and not limitation. Figure 1 il- 
lustrates operating system 134, application programs 
135, ofhcM^ proqrmn modules 136, and program data 
1 37^. 

[0037] The computer 110 may also Include other re- 
movable/non-removable, volatile/nonvolatile computer 
storage media. By way of example only, Figure 1 illus- 
trates a hard disk drive 141 that reads from or writes to 
non-«movabie, nonvolatile magnetic media, a magnet- 
ic disk drive 151 that reads from or writes to a remova- 
ble, nonvolatile magnetic disk 152, and an optical disk 
drive 1 55 that reads from or vwites to a removable, non-^ 
vQi^tjla optk^al disk 158 as a CD ROM or other op- 
tical media. Other removable/nonremovable, volatile/ 
nonvolatile computer storage media that can be used in 
the exemplary operating environment include, but are 
not limited to, magnetic tape cassettes, fia^ memory 
cards, digital versatile disks, digital video tape, solid 
state RAM, solid state ROM, and the like. The hard disk 
drive 141 is typMcatly connected to the system bus 121 
through a non-removable memory interface such as in- 
terface 140, and magnetic disk drive 151 and optica 
d(sk dnve 155 are typically connected to the system bus 
1 21 by a removable memory interiaoe, such as Interface 
150. 

f0038J The drives and their associated computer stor- 
age media discussed above and illustrated in Figure 1, 
pn^vide storage of computer readable instructions, data 
structures, program moduies and other data for the 
computer 110. In Rgure 1, for example, hard disk drive 
141 re thustrated as storing operating system 144 ap- 
plication programs 145, other program modules 146 
and program data 147. Note that these components can' 
either be the same as or different from operating system 
134, application programs 135, other program modules 
136, and program data 137. Operating system 144 ap- 
plication programs 145, other program modules 146, 
and program data 147 are given different numbers here- 
to illustrate that, at a minimum, they are different copies. 
A user may enter commands and information Into the 
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computer 1 1 0 through Input devices such as a keyboard 
162 and pointfng device 161, commonJy referred to as 
a mouse, trackball or touch pad. Other input devices (not 
showr>) may include a microphone, joystick, game pad, 
saleifite dish, scanner, orttie like. These and other Input 
devices are often connected to the processing unit 120 
through a user Input interface 160 that is coupled to the 
system bus, but may be connected by other int^ca 
and bus structures, such as a parallel port, game port 
or a universal serial bus (USB). A monitor 191 or other 
type of display device Is also connected to the system 
bus 121 via an interface, such as a video interface 190, 
in addition to the monitor, computers may also include 
other peripheral output devrces such as speakers 197 
and printer 196, which may be connected through a out- 
put peripheral interface 195. 

[00391 The computer 110 may operate in a networked 
environment using logical connections to one or more 
remote computers, such as a remote computer 180. The 
remote computer 180 may be another personal compu- 
ter, a server, a router, a network PC, a peer device or 
other common network node, and tyj^calfy Includes 
many or all of the elements described above relative to 
tfie personal computer 110, although only a memory 
storage device 181 has been illustrated in Figure 1. The 
logical connections depicted In Figure 1 indude a local 
area network (LAN) 171 and a wide &ea nefwwk (WAN) 
173, but may also include other networks- Such net- 
vvorking environments are comnrKMipface in offices, en- 
terprise-wide computer networks, Intranets and the In- 
temet. 

[0040] When used in a LAN networking environment, 
the personal computer 110 is connected to the LAN 171 
through a network interlace or adapter 170. When used 
rn a WAN networking envin>nmen1, the computer 110 
typically includes a modem 172 or other means for es- 
tablishing communicatrons over the WAN 173, such as 
the Internet. The modem 172, which may be internal or 
external, may be connected to the system bus 121 via 
the user input interface 160, or other appropriate mech- 
anism, in a networked en>rfronnnent, program modules 
depicted relative to the personaJ computer 110, or por- 
tions thereof, may be stored In the remote memory stor- 
age device. By way of example, and not JImltatfon. Fig- 
ure 1 iliustrates remote application programs 166 as re- 
siding memwy device 181. It will be appreciated that 
the network connections shown are exemplary and oth- 
er means of establishing a communications fink be- 
tween the computers may be used. 
[0041] In the descnptlon that follows, the invention will 
be described with reference to acts and symbolic repre- 
sentations of operations that are perfwmed by one or 
more computer, unless indicated otherwise. As such. It 
will be understood that such acts and operations, which 
are at times referred to as being computer-executed, in- 
clude the manipulation by the processing unit of the 
computer of electricaJ signals representing data in a 
structured form. This manipulation transforms the data 



or maintains it at locations In the memory system of the 
computer, which reconfigures or otherv»dse alters the op- 
eration of the computer in a manner well understood by 
those skilled in the art. The data structures where data 
^ Is maintained are physical locations of the memory that 
have particular properties defined by the format of the 
data. However, while the invention is being described in 
the foregoing context. It is not meant to be limiting as 
those of skill in the art will appreciate that various of the 
fo acts and operation described hereinafter may also be 
implemented in hardware, 

[0042] As illustrated above, establishing peering rela* 
tions is an expensive process In FrBenet. This forces a 
relatively stafk; graph, in which requests and files can 
IB only be fbnwarded along preexisting associations. How- 
ever, the response rate improved when the number of 
associations per peer increased. This suggests that bet- 
ter results may be obtained if the peers where allowed 
to spontaneously build relations. /Mother observation is 
20 that an LRU management of the knowledge table may 
well be counterproductive. The clustering efTect may oc- 
cur, but it occurs in a haphazard way. Arrival of new In- 
formation, through the results of queries, may in fact 
drown the useful knowledge in useless noise. In the sys- 
25 tern of the present invenfion, the knowledge manage- 
ment privileges the keys that are most similar to the keys 
of the local peer. 

[0043] in an embodiment of the system of the present 
invention, each node accumulates a rou^ng table that 
30 contains a list of references to other nodes in the net- 
work. For each node entry, address Information, which 
may include a node identification, address, the key of 
the node, and the distance between the key of this node 
and the key of the locai node are obtained. Each time 
35 the local node learns about a remote node, it chedts 
whether the node is already known, and if not whether 
to enteran entry In tiie rouSng table. This will always be 
the case as long as the number of entnes has not 
readied the size limit. When the limit is reached, any 
'^0 new entry will replace an existing entry; there are'sev- 
eral implementations strategies, such as trying to find 
well spaced entries in the name space, trying to select 
entries that are in close pnoxffnity to the local node, or 
SHTiply selecting at random an existing entry to be re- 
placed. This way, the routing table will always be re- 
freshed to contain active neighbors. The repiacemeni 
mechanism does not affect the direct neighbors. These 
neighbor entries are created when the graph is set up. 
This restrfctk>n is used in one embodiment in order to 
so maintain some degree of connectivity. In a preferred em- 
bodiment, the replacement strategy is random in each 
cache level, rather than based on a utility measure. 
Each entry has an Ideal cache lever determined by its 
'distance' from the cache owner. New entries may only 
^5 be added to the cache level corresponding to their dis- 
tance, or to the lowest level if the entry^s Ideal cache 
level' has not been breached yet. 
[Q044] In the system of the present Invention, when a 
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node receives a query it searches for the entry i n its rout- 
ing table whose key best matches the target, excluding 
the nodes that have already been visited. The query Is 
then forwarded directly to the node that advertised the 
entry. If there ts no adequate entry, the request is sent 
back to the node from which the request was received; 
this node will try anoihca- entry in its own routing table. 
The request Is successful If it reaches the entry whose 
key matches the target, ft is unsuccessful If the target Is 
not reached fn the maximum number of steps, or if the 
node from which the request was received tries all pos- 
sfble nelghl>ors and receives a negative response. In the 
case of successful requests, the response is relayed by 
a\\ intermedfate hops. It carries the address of the node 
that held the target key, and this entry can be inserted 
in the routing tabies of the rntemiedlate nodes. 
[Q045i In one embodiment the node Identifiers are 
rsndomiy sefected integers, e;q3ressed in complement 
to 2 notation. With this notation, In an n-bit fonnaf, the 
numbers vary between -2("-i> and 2(n-"')-'1. The distance 
function calculates the absolute difference between two 
integers. This provides for a crrcular space in which the 
distance between the extreme points, -2<""^> and 2f"-i)- 
1, is 1. An iJiusfra^n of this drcufar number space 182 
is provided in FIG. 2, In a network of H nodes, N ran- 
domly spaced identifiers in that circular number space 
are chosen. If a node accumulates a cache of K entries 
containing the identifiers of the K nodes whose identifi- 
ers are closest to its own Identtflerr that cache may be 
viewed as covering a sliver of Ihe drcular space whose 
average angular magnitude would be a=27cK/N. 
{0046] When a node of identffier X receives a query 
for an fdentiiler Y, the angular distance between X and 
Y may be expressed as p^27i|X-Y|/2", If p is lower ti^n 
a/2» then Y should be one of the K tithes in the rocai 
cac^e since tiie node will know of approximately all of 
the peers wfthin this dose sliver, as wiil be discussed 
below. The target has been found. In the other cases, 
the node wilJ ^SkM the entry that is closest to Y. That entry 
will be one of the edges of the sliver, which means that 
at the next step (hop)» the angular distance will be re- 
duced on average by a/2. At this next hop (having an 
Identifier of X', this process will be repeated, until the 
target is found. As illustrated in FIG. 2, an additional hop 
to a node having an Identifier of X" is required to find the 
desired target Y The maximum value of p is tt, cwre- 
sponding to a distance of S*"^^), over which an average 
of N/2 entries will be found. Each step (hop) reduces the 
angle by cx/2, corresponding to K/2 envies. After a suf- 
ficient number of steps (hops), the request is assured to 
reach the target. Since a/2 corresponds to ttK/W, and 
since the maximum angle is n the ma)dmum number of 
steps is: H - ;r/(a/2) = N/K. Upon success, each node 
wIR acquire information about at least one node In suc- 
cessive slivers of the circular number space. This Infor- 
mation is used to build a multilevel cache (described be- 
low) having a hierarchy of ever dense knowledge. 
10047] This Illustrates one reason why the system of 



the present invention is successful. However, it also 
shows that, to maintain a small hop count, the size of 
the cache must grow linearly with the size of the net- 
work. This is not acceptable. In order to be practical, the 
5 system must scale better than linearly A solution Is to 
use the muHileve! cache, i,e. spirt the routing cache in 
two parts, one containing the entries whose keys are 
nearest to the local key, another containing entries se- 
lected at random. A? an Illustration, suppose that the 
^0 first cache contains K1 entries, and the second one K2. 
The first node that processes a request will select the 
entry whose key Is dosest to the target. If that entry Is 
in the set K1 (corresponding to an angle of a), the 
processing is complete, if It is in the set K2, the maxi- 
ms mum distance between the target X and ^e selected 
entry Y will be at most half the distance between two 
entries in the set K2. If the entries were equally spaced, 
the distance would be 2<"''^)/K2, and the con-esponding 
angle would be jc/K2. At the next node the query will 
be processed as discussed above. Each hop win see an 
angular reduction of at least ce/2, where a=2jtK1/N. The 
number of hops wlil thus be H -1+2.p/a " 1+N/{K1 .K2). 
For a given total number of entries to K=:K1+K2, the 
smallest value of H is obtained when K1 =K2=K/2. 1 n this 
case, H = 1 + N/(K/2)2. 

[0048] The size of the Identifiers may vary with the 
size of the network and need not be expressed In com- 
plement to 2 notation. However, a key requirement of 
the protocol is that node identifiers can be treated as 
30 Integers, varying between a minimum value NMIN and 
a maximal value NMAx. fn this space. DMAX may be 
defined as the maximum distance between tvro points, 
such that DMAX = (NMAX-NMIN)/2. As such, the dis- 
taice D between two Identifiers X and Y is defined as 
fdlowsr (1) if X <Y, then swap X and Y; (2) if ((X-Y) 
<DMAX), then D = X-Y, else D - (NMAX +1 ^ X) ^ (Y - 
NMIN). As Indicated above, If numbers are stored in fcH- 
nary using complement to 2 logic, then the distance can 
be computed as the absolute value of the difference be- 
<o tween X and Y. 

[0049] When processing queries, the cache is used 
to find the most suitable next hop. This Is done by first 
finding the subset of cache entries whose address is not 
already listed in the fist of relays. If the subset is empty, 
an irxiication of failure is returned. However, If the subset 
contains exactly one entry, Xh^X entry is returned. Oth- 
erwise, two entries whose identifier Is closest to the tar- 
get of the request are found. These entries may be 
named A and B, and their respective distance to the tar- 
so get may be named DA and DB. The protocol of the 
present invention then picks at random between A (with 
w^ght DB) and B (with weight DA), and this random pick 
is returned. 

[0050] In an alternate embodiment of the present in- 
55 ventlon. the cache may b© divided Into an arbitrary 
number of parts. For example, if we have P parts, we 
win have a first set containing K/P entries. The largest 
angle should be of order of p= P/2K. The next hop 
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will use the best match rn the next level set, which will 
contain K/P entries, spread on an angle of size P/K, After 
that hop, the r^duaf angle will be b' = P^f2KK This wHI 
continue until the last set fs reached, at which point the 
angle will be reduced, at each set. by cc/2=*K/(PN), The 5 
maximum number of hops will thus be H = P-1 + WiKf 
P)P. 

[0051J RG. 3 shows me e)qDected number of hops for 
a 500 entries cache and different values of P, The key 
point rflusfrated this figure is that it predicts that even if io 
the size of the network grew to lO^o entries, the requests 
would be solved in 6 hops or less, rf the cache was par- 
titioned in 5 ore data sets. For smaller networks, sifghtfy 
better results may be achieved with a iess^ number of 
partitions. In practice, the optimal number of partftrons 16 
wlir vary with the size of the cache and the expected size 
of the network. In the above computations, it isassumed 
that, m each data set, the keys are regularly distributed 
along the number space, fn networks where this is not 
necessarily taie, the system can obtain the same effi- 20 
ciency by aliowhng twice the number of entries. In one 
emt}odiment the nodes only learn the address and iden- 
tifiers of the nodes that were sending requests. In an 
alternate embodiment, the nodes that process each re- 
quest also learn the address and id^ifier of the re- 25 
sponder. in yet a further embodiment, in the case of 
failed requests, the nodes also learn ttie address and 
Identifier of the stations whose Identifier was closest to 
the target, even if the target was not present in the net- 
work. 3(j 
[00521 Ttifs multilevel cache 1S4 may be visuaiized as 
Illustrated in RG. 4. As may be seen, each ievet of the 
cache includes an indication of the MJN and MAX iden- 
tifier definingthe bounds of that Jevel. This MIN and MAX 
value Is determined for the initial level as MIN=X-N/2 35 
and MAX=X4-NJ/2, where N is the size of the number 
space and X is the local ID, Witiiin each level are the 
entries known by the node. Within successive levels, the 
MIN and MAX are defined as {L"1)/K, where L Is the 
number of the level. 40 
[0053J In an embodiment of the present inventronp a 
proactive cache buiid up strategy is used, in which each 
node, when rt connects to the network, sends a succes- 
sionof gratuitous requests ibrsttategfcafiy located iden- 
tifiers, in simuiatrons with 1000 nodes, 9 such requests 4^ 
have proven sufffcient to populate the caches, so that 
ail the queries sent during this simulation were served 
immediately, using in average 3 to 4 hops- However, 
more or fewer such requests may be utilized, 
[0054J The mufti-levef cache ts sto/ctured as a set of 50 
L levels, each holding at most K entries as Illustrated in 
FIG. 4. The number of levels in the cache will be a func- 
tion of the size of the network and of the number of en- 
tries in each partition. This Is problematic ance ^e 
nodes do not know a prior/ the size of the network to S5 
which they attach. For this reason, the nodes dynami- 
cally add a level to their cache If they find out that the 
**last lev^" is fuii. The number of entries in the last level 



of the cache is in fact a good prediction of the size of 
the network. This level is supposed to contain a com- 
plete sampling of the nodes whose identifiers fall in an 
interval of size 2''DMAX/{K(^'% If the number of levels 
in the cache Is dynamic, then me only parameter that 
must be chosen is the number of entries per cache level . 
The chok:e of this parameter is a compromise between 
the efficiency of the query resolution procedure and the 
amount of ancillary tratfic required to set up the cache 
content. FIG, 5 shows how the average number of hops 
required to solve a query varies as a function of the size 
of the cac^e. The computation assumes that the data 
are distributed randomly, and that the bounds of the 
cache for each level are computed as specified herein, 
i.e. dividing the covered size by K/2 at each level. 
10055] In one embodiment, the value of K is set to 20, 
although this value may be set to other values depend- 
ing on network size, hop count iimits, etc. Each level of 
the cache is characterized by a maximum distance to 
the focal identifier The distances are a tunctlon of 
□MAX, the maximum distance between two valid iden- 
tifiers. DiWAX is a function of the nun^er space, of a 
coefTicient P equal to N/2, and of the cadie level. The 
last cache level contains enMes whose distance to the 
local kientlflleris smaller than or equal to DMAX/(P(L- D). 
The first cache level contains entries whose distance to 
the local identifier Is larger than DMAX/R The other 
cache level contains entries whose distance to the locai 
identifier is largerths^ DMAX/(P4, where L is the vaiue 
oftheleveL 

(0058] When a riode reams about a new entry, it tries 
to insert it in the cache. To do this, it performs the fol- 
lowing steps. First, if the entry is already in the cache, 
the certificate of that entry is replacedi ts/ the newly 
learned value, if that value is most recent. Second, if the 
entry is not in the cache, the distance between the local 
idCTitlfier and tiro entry identifier ts computed. This is 
used to determine the level at which the entry should be 
cached. If the seized level is the last level of the cache 
currently existing, and if there are K or more entries in 
the cache for that level, tien a new level is added (set 
L - L+ 1), The entries at level L are then divided between 
these two levels according to their distance to the local 
ID. The selected level for tfie new entry is then reas- 
sessed. This process fs repeated if necessary. If, how- 
ever, there are less than K entries in the cache Ibr the 
selected level, the new entry Is simply added. If there 
are K entries In the cache for the selected level, and if 
me selected level Is not the last level of the cache cur- 
rently existing, a replacement algorithm is implemented 
to determine whether the new entry should replace an 
existing entry, and if so, which entry tt should replace. 
The simplest ngpJacement algorithm is a "random re- 
placem^t^ i.e. select at random one of the K cache 
entries and replace it by the newly learned vaiue. Finally, 
if the new entry was added to the last level, a flooding 
algorithm discussed below is performed. 
I00S7] When a node adds an entiy in the last level of 
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its cache as just discussed, or if it replaces an existing 
entry wfth a more recent value, the node engages in a 
flooding procedure. To accomplish this procedure, the 
node prepares a flooding message containing the ad- 
dress certificate of the local node, with an empty itst of 
already flooded nodes. This message ts then sent to the 
address of the new entry. A list of the nodes in the cache 
whose distance to the new entry fs smafier than DMAX/ 
(P(L-1)) is then prepared, if the addition of the new entry 
was a resuft of a flooding message, the nodes that 
marked as already flooded are removed from the list. 
The node then prepares a flooding message containing 
the address certificate of the new entry. The fist of al- 
ready flooded nodes is set to contain the local node, all 
the nodes in the nst, and, if the addition results from a 
flooding message, alt the nodes marked as already 
flooded In fiiat message. A copy of frie message Is then 
sent to all the nodes in the list Nodes with limited ca» 
pacity may opt to restrict the size of the list of "flooding 
targets." If they do so. they should retain in the list the 
nodes whose identifier is closest to the local identifier- 
[00581 As indicated above, cache entHes are repre- 
sented by an address certificate that contains a date of 
validity* To maintain only current information about the 
other nodes in the network, and to reduce the clutter of 
obsolete data, cache entnes are removed from the 
cache when the date of validity fs passed in one embod- 
iment of the fH-esent invention. Each node tt^at partici- 
pates In the network In this embodiment therefore, reg- 
ularly renews rts address certificate. Upon renewal, the 
new value is flooded, as if the nod^ had learned a new 
entry In the fast level of its cache. Thrs process ensures 
that Its entry in the caches of other nodes will not be 
removed as obsolete. 

[0059] At regular intervals, the nodes preferably ex- 
amine the structure of thefr cache, and make sure that 
each cache level ts reasonably complete. The last cache 
level Is deemed to be always complete; any oth^ cache 
levol is not complete if Hie largest interval between two 
cache entries at this level is larger than 2*DMAX/(P*), 
where I is the value of the level. If this is the case, the 
node will format a request message whose target is 
paced in the middle of the largest interval, and wili en- 
gage in a standard processing of this request 
[0060] As illustrated above , the system of the present 
invention can handle networks of arbitrary sizes. How- 
ever, each node must have a correctly inmafized cache. 
As In Freenet, nodes learn about other nodes as a side 
effect of the regular processing of requests. Unfortu- 
nately, merely relying on the natural learning process Is 
not sufficient to achieve reliable performance. There- 
fore, &ie system of the present invention complements 
the natural learning with three explldt procedures, the 
synchronization of direct nefghbors, the flooding of the 
level 1 updates, and the forceful announcing of a node's 
presence. The neighbor synchronizaSon pyocedure Is 
designed to rapidly Initiarize the cache of a node that 
ccMTinects to the network. The procedure is also used 



when two nodes decide to make a fornial connection. 
This is an easy way to cure a split. The flooding proce- 
dure ensures that the last level of the cache is consist- 
ent. This is necessary to guarantee that the searches 
5 will actually complete. Finally, the nodes announce their 
presence by looking for an identifier that is very close, 
but not equaJ. to their own Id^tifier, The request is rout- 
ed to an existing neighbor. By doing so, rt is guaranteed 
that the host wiii be known by some related site. This 
-fo win also guarariiee that the flooding procedure is trig- 
gered, 

[0061] Before discussing the actual processing of 
these requests of the protocol of the present invention, 
it is instructive to discuss exempfary message and data 
formats apfH-opri ate In the protocoi of the present inven- 
tion. For example, each node in the network is described 
by an address certificate that contains a node i(ier}tifier, 
the node address, a date until which the certificate is 
valid, and key and signature information. The format of 
20 the key ^d signature InfcHmatlon will vary y/^lh the spe- 
cific implementafion. The Important points are that the 
information is sufficient to prove that the node is a mem- 
ber of the peer-to-peer network, and that the relation be- 
tween the node and the Identifier is genuine. The date 
25 field is used to make sure that flie information is up to 
date as disojssed above with regard to the obsoles- 
cence of cache entries- 

[0062] A request message contains the message 
code, REQUEST, the target of the request, the address 
30 certificate of the origin of the request, the maximum 
npjmber of relays aHowed for this message, and a 
progress list that contains for each node that processed 
the request: the address of the node; and an Indication 
of whether the node accepted or refused the request. 
35 When the request is originated, the requesting node 
sets the message code, the target value, and the ad- 
dress certificate of the origin. The number of nodes is 
set to 1, and the pnagress ilst is initialized to contain ex- 
actly one entry with the adcfress of the origin and an in- 
^0 dication that the request was accepted, A REQUEST 
may also contain a 'best match' certificate to help ensure 
a closest match is relumed in ^e case v^^ere ^ exact 
match is not found. 

[0063} A response message contains the message 
^5 code, RESPONSE, the target of the request, the ad- 
dress certificate of the node that best matched the re- 
quest and a progress list that contains for each node 
that accepted the request and has not yet processed ^e 
response the address of the node. Nodes get removed 
50 off the response list as the message makes Its way to- 
wards the initial requester A fiooding message contains 
the message code, FLOODING, the address certificate 
that IS being flooded, a list of all nodes that have already 
received a copy of the certificate, containing for each 
55 node the address of the node. Nodes get added to the 
Nst as the flooding progresses. A neighbor synchroniza- 
tion request contains the message code, SYNCHRO" 
NtZE, the target of the request, expressed as a node 
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identlHer, and the address certrficafe of the node that 
solicits the neighbor. A neighbor advertisem^t mes- 
sage contains the message code, ADVERTISE^ the up- 
per range of the advertisements expressed as a node 
Identifier, the address certfficate of the node sending the 
advertisement, and a list of entries for which a certificate 
rs available, containing for each entry the identifier of 
the entry, Rnaily, a neighbor synchronfzatron request 
contains the message code, SOLICIT, the target of the 
solfcltatfon, and the address certificate of the node that 
solicits the neighbor. 

10064] Having now described a set of messages ap- 
plicable to the protocol of the present invention, atten- 
tion is now turned to the resolution procedure introduced 
above. Spedfjcafly, the query resolution procedure is 
the process by which unique numbers get resolved to 
addresses. The node that requests a resolution formats 
a request message according to the specification dis- 
cussed above, and forwards that message to the most 
adequate neighbor The node that receive a request 
process It, and can either send back a response, fonward 
the request fo another node, or send back a refusal If it 
cannot process the request. 

[0065] When a node receives a request message, it 
first checks that the certlffcate of the origin rs vaifd. If the 
certificate is invalid, the request wHi be refused. If the 
certificate Is valid, the node updates Its cat^ Infomia- 
tion with the address certificate of the origin according 
to the rules specified above. It will then proceed with the 
message according to the following steps. First, the tar- 
get of the request IS connpared to the local Identltier. If 
the two values are Identicai, the final value has been 
found. The procedure then proceeds to step four, other- 
wise it continues to the second step. Second, the list of 
relays is checked to determine if it aiready contains an 
entry for the host. If this is true, the process proceeds 
to step four Third, the number of nodes in the list of re- 
lays Is checked to determine If it is lower than the 
number of allowed r^ays. If this is false, the process 
proceeds to step four. If this is true, however, an entry 
Is added to the list containing the address of the node 
and an Indication that the node accepted the query. 
Once this is complete, the process then proceeds to 
step four. 

[0066] In step four, if the identifier matched the target, 
or if the number of relaying nodes has already reached 
the allowed number, the node updates the message 
code to RESPONSE and places its own address certif- 
icate as the certificate of the best matching node. If the 
list of relays already contains an entry fw the host, the 
message code is also changed to response, but the host 
does not update the certificate of the best matching 
node. The relay list of the response will only contain the 
fieiaying nodes that accepted the request, if the local 
node Is the origin of the request, the processing Is com- 
plete; otherwise, the message Is relayed to the first entry 
that precedes the local node in the fist of relays and 
whose code indicates that It accepted the request 



[0067] The. node uses the cache information to try to 
find a suitable next hop whose address Is not already 
listed in the fist of relays. If there is a suitable next hop, 
the message Is relayed to that host However, If there Is 
^ no suitable next hop, the entry corresponding to the re- 
laying node Is nrradlfied to Indicate that the request was 
not accepted. If the node is the originator of the request, 
then the request has failed. Otherwise, the message is 
relayed to the first entry that precedes the local node In 
^0 the list of relays and whose code indicates that It accept- 
ed the request. This procedure is designed to place all 
the transaction state Inside the message. As such, in- 
termediate nodes do not have to keep a list of ongoing 
transactions. 

IS [0068] When a node receives a response message, 
it first checks that the certificate of the best match Is val- 
id. If the certificate is invalid, the request is refused. If 
the certificate Is valid, the node updates its cache infor- 
mation with the address certificate of the best match ac- 
20 cording to the procedure discussed above. It then pro- 
ceeds with the message according to the following 
steps. First, if the best match identmer Is not equal to 
the target of the request, and if the local identifier Is clos- 
er to the target than the best match identmer, the node 
2S replaces the best match certificate by the local certifi- 
cate. Second, the node's entry is removed from the relay 
fist If the local node was the first entry in the relay list, 
the request is complete. Otherwise, the response is re^ 
iayed to the last remaining node In the list 
30 The intermediate relays do not need to keep state in or- 
der to execute correctly this protocol 



[0069] Having described the core Peer4o-Peer Name 
Resolution Protocol (PNRP) of the present invention, a 
mechanism to allow resolution of names through PNRP 
3S is now discussed. In summary, these names are 
mapped onto the circular number space discussed 
above through a hash function, e.g, MD6. However, 
there may be multiple entries for a single hash value (e! 
g. In large groups of 10,000 menribers). As such, the 
^0 group will be located on the circular number space 182 
as a single entry 186 as illustrated in FIG, 6, having a 
large group 188 associated th^ewtth. If this were the 
oniy mechanism forthe resolution of names to numbers, 
each node corresponding to that hash would have to 
4S have an enormous cache of all members within the 
group to satisfactorily resolve the search. To overcome 
this limitadon, a unique number is associated with the 
hash of the name as <hash>.<unique number> (<M>. 
<N>). The practical result of this addition is to expand 
50 the circular number space 1 90 to \nciu(ie a mapping of 
each group member With this ext^sion, the core pro- 
tocol discussed above may be used for names as well 
as numbers, and may scale to large groups. 
[0O7O] This peer to peer name resolution protocol 
5^ (PNRP) allows peers to resolve globally unique ID'S into 
peer address certrffcates, A globally unique peer ID Is 
preferably a 128-bit Identifier. Ideally, peer IDs are ran- 
donrily distributed In the peer ID number space, A peer 
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address certlflcafe (PAC) is a collection of data associ- 
ated wi«i a peer ID and contains the peer ID, peer 
stance address, peer friendly-name, fuli public key, and 
a signature which verifies integrity of the entire certifj- 
cate, excluding the public key and derivations of the 
pubKc key. Other data may be rncfuded in the PAC as 
needed. As discussed below, the system of the present 
fnvention utilizes peer IDs. and a category peer ID prefix 
useful for locating arbitrary instances of a class of peer 
resource, 

[0071] Ideal properties f<^ a peer ID scheme include 
fandcHTi distribution, deriyabHIty, security fabler, and 
Histantiabiiity. By random distribution, it Is preferred that 
instantiated peer IDs have a random distribution in the 
peer ID space discussed above. The less ciustered the 
IDs, the better PNRP resolution works. By derivability, 
it is meant the abJilty to generate a peer ID from a com- 
mon, unique friendy name, Derivabllity allows one to 
obtain s peer ro without knowing it in advance. This is 
advantageous because one can remember a more in- 
tuitive name such as an email address easier t\an a nu- 
meric peer ID. The security enabler refers to a peer ID 
composition that discourages identity theft. That is, in a 
preferred embodiment the system of the present inven- 
tion Identity ownership is verifiable. Finaliy. the PNRP of 
the present invention includes a weii-defined mecha- 
nisms for allowing more than one active instance of a 
Peer ID, e.g.. a user's peer ID active on two machfries 
simultaneously. 

[0072] The PNRP keys have two components: the 
identifier of the object, which in our embodiment is a 1 28 
bit number, and the identifier of the instance, which in 
our embodiment is another 128 bit number, typically the 
JPv6 address at which the entry is available. Each PNRP 
entry will contain, in addition to the key, a "proof that 
can be used to check that the entry's pubflsher is entitled 
to puWish the identifier, and to ensure that the informa- 
tion about entries cannot be published against their wiH. 
[0073] I n the simplest form of entries, the object iden- 
tifier is the secure hash of the ixibiic key assodated to 
the entry; lhe instance identifier is the IPv6 address at 
which the object is avalJabie. The proof is obtained by 
signing the combination of identifier and instance with 
the private key associated to this public key. 
[0074] A group based identifier is the secure hash of 
the public key associated to the groi^; the instance 
kientW&rls the iPv6 address at which a group member 
Is present. The proof is obtained by signing the combi- 
nation of identifier and instance with the private key of 
the group member, and then by disclosing a certificate 
signed with the group's private key that assess that links 
the group member*© public key to the group. 
[0075] A user based identifier is the secure hash of 
the claimed identity of the user, expressed as an e-maii 
address, with the secure key of a naming authority. The 
proof fs obtained by signing the combination of identifier 
and instance with the private key of the user, and then 
by disclosing a certlfkiate signed by the naming authority 



that iinks the user name to the corresponding pubiic key. 
Since e-mail addresses must be globally unique ad- 
dresses, the hashing procedure allows for 2^^ unique 
peers before a 50% probability of collision between ex- 
5 tracts, 

[0076J With this extension to the core PNRP of the 
present invention, an individuai host's cache manage- 
ment may become more complicated. That is, if more 
tfian one ID fs stewed on a given host, cache manage- 
fo ment must ensure that good nelghbortioods are main- 
tained for each represented ID Jn the Ideaf case (ignor- 
ing memory and processing), this cache contains 
unique ievels for each ID up to the pc^nt where two ID's 
overiapped at the same cache level, then shared upper 
15 level caches. A cached PAC could have logical mem- 
bership in more than one cache level between different 
represented ID's. 

[0077] The second extension to the base protocol of 
the present invention provides a reai worid Integration 
20 of the peer-to-peer resolution protocol with the domain 
name system (DNS). By provkJIng each node with an 
Identification consisting of a DNS component and a 
unique number, the DNS mechanism can be used to lo- 
cate the centralized server for that DNS component. 
^5 That centralized server may then go into the peer-to- 
peer name resolution protocol (PNRP) ^ce using the 
protocol of the present invenfion with the unique number 
portion to nnd the particular node, and return that infor- 
mation to the requester. The individual node can Had a 
30 neighbor to hefp seed its cache by sending a request to 
the oantr^iized server with a random numb^ 
[007a] Specifically, the PNRP DNS linkage aitaws for 
the resolution of peer kJenllfiers (PriD's) into authorita- 
tive address certificates. This service aiiows subscribers 
35 to obtain the correct address for a connected peer. The 
internet uses DNS for address resolution. It is advanta- 
geous to link DNS to PNRP for name resoiufion. Such 
linkage should enafcde DNS clients to obtain the IP ad- 
dress of a PNRP ciient. using that client's friendly-name 
^ or encoded PriD, it also minimizes the risk of a DNS 
resolver caching anexpired address. Further, it is light- 
weight enough to run on any PNRP subscriber 
[0079J The PNRP DNS gateway will listen for TCP 
and U DP connections on port 53. it will onfy accept que- 
^5 ries with QCLASS=IN, and QTYPE = AAAA or A6 (iPv6 
addresses) or *. The gateway will divide the QNAME Into 
a hostname and a domain sufRx. The domain suffix must 
either be absent, or have 'P2P/ as Its leftmost compo- 
nent. Any other domain sufRx will result In 0 answers. 
50 Preferably, the suffix is made part parametrizabie, a def- 
inition of the naming cloud, 

[0080] When the gateway receives a valid query. It will 
perform up to two PNRP searches on the hostriame. 
First, a search will always be performed upon the results 
55 of the default friendly-name-to-PrlD conveiBion. Prefer- 
ably, this converskjn Is a 128-bit secure hash of the 
friendly-name. Second, if the hostname con^esponds to 
a valid ASCII r^resentation of a hexadecimal PriD, the 
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hostname wilt be converted to a binary PrID, and a 
search for that PrID Jnitiated. Recognizing a need for 
stronger security, a combination of a strong hash and 
secret may be used. If either search returns an address 
certificate which exactrymatcties the query, a DNS A- 5 
record is constructed for the match. llieA-^record TTL is 
set to either 10 minutes or the TTL of the addresscertif- 
icate, whichever is shorter. The response is mariced 
asauthoritatJve, 

lOOaiJ A DNS server may be finked to a PNRP DNS io 
gateway one of two ways. First, a new zone may be ere- 
ated w^fch is a chfid of the DNS server^s authoritative 
zone. For exampfe, NTDEV. MIC ROSO FT.COM au- 
thoritative name server would liave a zone P2P.NTDE\A 
MJCROSOFTCOM with one or nrore NS records point- is 
ing to local PNRP DNS gateways. Second, a new zone 
<P2P> may be created, where <P2P> is an arbitrary val- 
ue, such as for exampfe "pnrp.net" or "p2p,microsoft. 
com". If such a zone rs present In each domain, pointing 
to the closest P NRP DNS gateway, peers may use DNS 
to discover their focal PNRP access point by resolving 
<PrtD>P2R Ideally, each DNS server would have both 
zones defined, to allow both local and global access to 
k>cal P2P networks. 

[0082] An example of this extension of the PNRP of 25 
the present invention to DNS is illustrated in FIG. 7. This 
figure i fiustrates the two spaces, the DNS space 200 and 
the peer to peer apace 202. The linf<age between these 
two spaces fs provided by a server 204 having an ex- 
emplary name of p2p.microsoft-com. A node 206 exist- 30 
ing in the peer to peer space 202 may have an exem- . 
piary name of 123450AF39.ptp.mlcrosoft.com. Alterna- 
tively, the unique number ID may be replaced with a 
friendly name as discussed above with regard to the 
name to number extension to the care protocol. When 35 
a node 208 in the DNS space 200 wishes to find the 2 
node 206 in the peer to peer space 202, it 208 sends a 
DNS query to the .com root server 210. which passes 
the query to the microsoft server 212, which passes the 
query to the ,p2p server 204. This server then uses the 40 
node id and the ixotocol of the present invention to find 
the target node 206 in the peer to peer space 202 as 
discussed above. When the target node 206 is found, 
the address is returned to the requesting node 208 irl 
the DNS space. When a new node 214 wishes to pfug 45 
into the system and seed its cache, it simply sends a 
request for a node having an Id in the form of <random 
numb^>,p2p.microsoft.com. Of course, one skilled in 
the art will recognize that other implementations may 
use a different domain name. e.g. "pnrp.net" instead of so 3. 
"p2p.microsoft.corri." 

[0083] The foregoing description of various embodi- 
ments of the invention has been presented for purposes 
of if lustration and deBcripiion, it Is not intended to be ex- 
haustive or to iimit the invention to the precise embodi- 55 
ments disclosed. Numerous modificaflons or variations 
are possible in light of the above teachings. The embod- 
iments discussed were chosen and described to provide 



the best lllusfration of the princip«es of the invention and 
Its practical applicafion to thereby enable one of ordinary 
sk\\\ in the art to utilize the invention In various embodi- 
ments and with various modifications as are suited to 
the parttcuiar use contemplated. All such modifications 
and variations are within the scope of the invention as 
determined by tiie appended claims when interpreted in 
accordance with the breadth to which they are lairfy. le- 
gally, and equitably entitled. 

Claims 

1- A serverless name resolution protocol through 
which unique numbers are resolved to addresses, 
comprising the steps of: 

receiving at a Ifrst node a request message 
from a request^- fKxJo seeking address resolu- 
tion of a second node having a unique numb^ 
Identifier, the request message including ad- 
dress information of the requester node; 
populating a routing table of the first node witfi 
the address Information of the requests- node; 
analyzing the request message; 
generating a response message to the request- 
er node identifying address information of the 
first node as best matching forthe request mes- 
sage when one of three conditions is met; oth* 
erwise 

determining a suitable next hop forthe request; 
and 

forwarding the request message to the suitable 
next hop. 

The protocol of daim 1 , wherein the step of analyz- 
ing the request message comprises the step of 
comparing the unique number identifier to the ad- 
dress infomiation of the first node, and wherein the 
step of generating a response message to the re- 
quester node identifying address information of the 
first node as best matching Jbr the request message 
when one of three conditions is met comprises the 
step of generating a response message to the re- 
quester node identifying address information of the 
first node as best matching forthe request message 
when the unique number identifier is kienflcaf to the 
address informatiOT of the first node. 

The protocol of claim 1, wherein the request mes- 
sage contains a maximum hop count value and a 
list of node that have processed the request mes- 
sage, and wher&n the step of analyzing the request 
message comprises the step of determining if a 
number of nodes which have previously processed 
the request message exceeds the maximum hop 
count, and wherein the step of generating a re- 
sponse message to the requester node Identifying 
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address information of the first node as best match- 
ing for the request message when one of three con- 
dfttons is met comprises the step of generating a 
response message to the requester node Identify- 
mg address information of the first node as best 5 
matching for the request message when the 
number of nodes which have previously processed 
the request message exceeds the maximum hop 
count. 

4. The protocol of claim 1, wherein the request mes- 
sage contains a Rst of nodes that have processed 
the request message, and wherein the step of an- 
alyzing the request message comprises the step of 
detemiinrng if the address information of the first 15 
node Is in the list of nodes that have processed the 
request message, and wherein the step of generat- 
ing a response message to the requester node 
identifying address Infomiatlon of the first node as 
best matching for the request message when one. 20 
of three conditions is met comprises the step of gen- 
erating a response message to the requester node 
identifi^ng adc^ess Information of the first node as 
best matching for the request message when the 
address information of the first node is in the fist of 25 
nodes that have processed the request message. 

5. The protocol of claim 1, wherein the request mes- 
sage Inciudes a certificate of origin, further comprls-^' 
ing the steps of checking the certificate of origin to 3o 
detemiine its validity, and refusing the request mes- 
sage when ^e certificate of origin ts Invalid, 



6. The protocol of claim 1, wherein the step of popu- 
Jatlng the routing table conprf ses the steps of: 



55 



determining If the address information of the re- 
quester node is aiready in the routing table; 
refreshing the address infbmr»ation of ihe re- 
quester node If more recent than the address 4o 
infornnation of the requester node aiready 
stored in the routing table: else 
computing the distance between the address 
information of the first node and the requester 
node; 

determining from the distance a selected level 
info which to store the address Information of 
the requester node; and 

storing the address information in the selected 
level. 5^ 

r. The protocol of claim 6, wherein the selected level 
rs a last level havmg K entries stored therein, and 
wherein the step of determining the selected level 
comprises the steps of determining that an entry 55 
Should be replaced, and replacing the entry with the 
address information of the requester node. 



8. The protocol of claim 6, wherein the selected level 
is a last level, further comprising the steps of pre- 
paring a flooding message containing the address 
jnfomiation of the first node with an empty fist of al- 
ready flooded nodes, and sending the flooding mes- 
sage lo the requester node. 

9. The protocol of claim 8, further comprising the steps 
of preparing a list of nodes in the routing fable 
whose distance to the requester node is smaller 
than DMAX/(P(t^ % remove from the list nodes that 
are marked as aiready flooded when the addition of 
the new entry Is a result of a iTooding message, pre^ 
paring 3 fiooding message containing the address 
rnfonnation of the requester node, and send the 
flooding message to ail nodes in the list. 

1 0. The protoa)i of claim 6, vtrfherein the selected level 
Is a last level having K or more entries stored there- 
in, and wherein the step of deterniining the selected 
level comprises the step of adding a new level and 
splitting the entries in the last level between the last 
level and the new level accondrng to a distance from 
the address Information of the first node. 

11- The protocol of claim 1, further comprising the steps 
of checking a date of validity for address information 
in the routing table, and removing address Inft^ma- 
tion for which the date of validity has passed. 

12, The prctGccsi of claim 1, wherein the step of deter- 
mining a suitable next hop for the request compris- 
es the steps of Hndlng a subset of routing table en- 
tnes whose address is not aiready listed in the re- 
quest message, returning an Indication of failure 
when the subset is empty, returning a particular en- 
try when the partlcuJar entry is the oniy entry in the 
subset. 



13. The protocol of claim 12, ^rther comfwising the 
steps of finding two entries whose identifiers are 
closest to the second node, randomly pick one of 
the two entries, and return the randomly picked en- 
try. 

14. The protocol of claim 1 . further comprising the steps 

receiving a response message including ad- 
dress Information of the second node and ad- 
dress information of a best match node; 
comparing the address infonnation of the sec- 
ond node and the address infonnation of the 
best match node; 

replacing the address information of the best 
matc^ node with the address information of the 
first node when the address information of ^e 
best match node is not equaJ to the address In- 
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formatfor> of the second node and the address 
information of the first node is doser to the ad- 
dress informatron of the second node than the 
address Information of the best match node; 
and g 
relaying the response message to the request- 
er node when the requester node is not the first 
node. 

15. The protocol of clafm 1, turther comprising the step to 22. 
of formrng the unfque number identifier of the sec* 
ond node by computing a hash of a name of the 
second node. 



16. The protocor of cfaim 15. wherein the step of form- 15 
ing the unique number identifier further comprrsas 
the step of associating a unique number with me 
hash of the name to form the unique number iden- 
tJfler In the form <hash>.<:unique numt)er>. 



randomly replacing another entry with the new 
entry when the level full and when the fevef 
is not a fast level of the multilevel cache; and 
adding a new level when the level is full and 
when the level Is the last level of the multilevel 
cache, sorting entries Itom the last level be- 
tween the last level and the new level, and stor- 
ing the new entry. 

The method of claim 21 , wherein the step of deter- 
mining in which level the entry is to be stored com- 
prises the step of detemiining if the entry already 
exists in the multilevel cache, and replacing an ad- 
dress certificate for the entry when the address cer- 
tificate is more recent than the address certificate 
of the entry stored in the multilevel cache. 



17. The protocol of claim 1 , further comfM-isIng the step 
of extracting the unique numberldentifierof the sec- 
ond node from a unique name processed through 
a DNS query to a peer to peer server, the unique 
name taking the fomri <rpeer to peer identifier>. 25 
<DNS server address>. 

18. The protocol of claim 17, wherein the <peerto peer 
Identifier* is a unique name, further comprising the 
step of forming the unique number identifier of the 30 
second node by computing a hash of a name of the 
second node. 



23. The method of claim 21. wherein the st^ of deter- 
n™nlng in which level the new entry Is to be stored 
20 comprises the step of computing a distance be- 
tween a local identifier and an entfy idenfifl^. 



24. 



19. The protocol of claim 18, wherein the step of form- 
ing the unique number identifier further comprises 
the step of associating a unique number with the 
hash of the name to form the unique number Iden- 
tifier in the form <hash>,<unique number>. 

20. A multilevel cache for use In a peer to peer name 
resolution protocol, comprising a set of L levels, 
each level sl^ed to hold K entries, and wherein the 
numberof levels Lis dynamic, increasing by 1 when 
a K+l entry Is to be abided to a level w^en the 
L^^ level IS occupied by K entries. 

21. A method of dynamically maintaining a multilevel 
cache for use in a peer to peer name resolution pro- 
tocol, cofTHJrlsJng the st^s of: 

receiving a new entry to be entered ffito the mul- 
tilevel cache; 

determining In which level the new entry is to 
be stored; 

determining if the level into which the new entry 
is to be stored is fijil; 

storing the new entry In the level when the level 
is not ftjll; 
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A method of resolving a peer name to a peer ad- 
dress certificate, comprising the steps of computing 
3 hash value of the peer name, associating there- 
with a unique number Jn the fomi <hash>.<unique 
number>. formatting a request message, and trans- 
mitting the request message to a peer node. 



25. The method of dafm 24, further con^rising the 



receiving a response message including the 
peer address certificate and address inf<^ma- 
tion of a best match node; 
comparing the peer address certificate and ad- 
dress Information of a best match node; 
replacing the address information of the best 
match node with the address information of a 
local node when tfie address information of the 
best match node is not equal to peer address 
certriicate and the address Information of the 
local node is closer to peer address certificate 
nian the address information of the best match 
node; and 

relaying the response message to the request- 
er node when the requester node Is not the iocal 
node. 
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